This role offers an exciting opportunity to join one of the most exciting AI companies in San Francisco with founders hailing from MIT and Stanford this work class team is on a mission to make the next breakthrough in generative AI applications.
As a Senior AI Infrastructure Engineer, you’ll be at the cutting edge of technology, building a next-generation, ultra-resilient, global, and multi-cloud platform powered by open-source technologies. Your work will drive the acceleration of AI innovation, enabling rapid scaling and unlocking new possibilities for groundbreaking AI-driven projects.
Requirements:
- 5+ years of software development experience with expertise in at least one backend language
- Proven success in designing and building distributed cloud architectures, microservices, and large-scale systems across platforms like AWS, Azure, or GCP.
- Expert knowledge of operating systems, from multi-threading to memory management, networking, storage, performance, and scalability.
- Highly organized, methodical, and ready to take the initiative in solving complex challenges
- Experience with Kubernetes, containerization, AI workloads, and decentralized technologies is a big plus
- Familiarity with GPU programming, NCCL, and CUDA is a bonus
- 5+ years writing production-quality, high-performance code
Responsibilities:
- Drive the architecture and research behind decentralized AI workloads, shaping the future of AI infrastructure.
- Contribute to the core platform, focusing on innovation and open-source contributions
- Develop new services, tools, and resources that empower developers and enhance the platform.
- Build cutting-edge testing frameworks to ensure reliability, fault tolerance, and scalability across the entire platform
This is a chance to be part of something transformative—building the foundation for the future of AI technologies, working alongside brilliant minds, and pushing the boundaries of what’s possible. Are you ready to make your mark?